Search CORE

119 research outputs found

Can Clustal-style progressive pairwise alignment of multiple sequences be used in RNA secondary structure prediction?

Author: Amelia B Bellamy-Royds
B Masoumi
D Gautheret
D Mathews
D Sankoff
DH Mathews
DH Mathews
G Pavesi
G Storz
IL Hofacker
IL Hofacker
J Felsenstein
J Gorodkin
JD Thompson
JH Havgaard
JJ Cannone
KJ Doshi
M Anwar
M Sprinzl
M Zuker
M Zuker
M Zuker
Marcel Turcotte
P Rice
PP Gardner
PP Gardner
R Gutell
RD Dowell
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background In ribonucleic acid (RNA) molecules whose function depends on their final, folded three-dimensional shape (such as those in ribosomes or spliceosome complexes), the secondary structure, defined by the set of internal basepair interactions, is more consistently conserved than the primary structure, defined by the sequence of nucleotides. Results The research presented here investigates the possibility of applying a progressive, pairwise approach to the alignment of multiple RNA sequences by simultaneously predicting an energy-optimized consensus secondary structure. We take an existing algorithm for finding the secondary structure common to two RNA sequences, Dynalign, and alter it to align profiles of multiple sequences. We then explore the relative successes of different approaches to designing the tree that will guide progressive alignments of sequence profiles to create a multiple alignment and prediction of conserved structure. Conclusion We have found that applying a progressive, pairwise approach to the alignment of multiple ribonucleic acid sequences produces highly reliable predictions of conserved basepairs, and we have shown how these predictions can be used as constraints to improve the results of a single-sequence structure prediction algorithm. However, we have also discovered that the amount of detail included in a consensus structure prediction is highly dependent on the order in which sequences are added to the alignment (the guide tree), and that if a consensus structure does not have sufficient detail, it is less likely to provide useful constraints for the single-sequence method.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Separate Origins of Group I Introns in Two Mitochondrial Genes of the Katablepharid Leucocryptos marina

Author: A Stamatakis
AS Tanabe
B Auer
BL Stoddard
CH Sellem
E Kim
I Masuda
JC Vaughn
JJ Cannone
L Sandegren
M Belfort
M Korab-Laskowska
M Zuker
MR Goddard
MV Sanchez Puerta
MV Sanchez-Puerta
N Lartillot
N Okamoto
P Haugen
PL Adams
R Kamikawa
R Saldanha
Ryoma Kamikawa
S Johansen
SQ Le
Stuart Alexander Ralph
T Muller
Tetsuo Hashimoto
Y Cho
Y Inagaki
Yuji Inagaki
Yuki Nishimura
Publication venue: Public Library of Science
Publication date: 11/05/2012
Field of study

Mitochondria are descendants of the endosymbiotic α-proteobacterium most likely engulfed by the ancestral eukaryotic cells, and the proto-mitochondrial genome should have been severely streamlined in terms of both genome size and gene repertoire. In addition, mitochondrial (mt) sequence data indicated that frequent intron gain/loss events contributed to shaping the modern mt genome organizations, resulting in the homologous introns being shared between two distantly related mt genomes. Unfortunately, the bulk of mt sequence data currently available are of phylogenetically restricted lineages, i.e., metazoans, fungi, and land plants, and are insufficient to elucidate the entire picture of intron evolution in mt genomes. In this work, we sequenced a 12 kbp-fragment of the mt genome of the katablepharid Leucocryptos marina. Among nine protein-coding genes included in the mt genome fragment, the genes encoding cytochrome b and cytochrome c oxidase subunit I (cob and cox1) were interrupted by group I introns. We further identified that the cob and cox1 introns host open reading frames for homing endonucleases (HEs) belonging to distantly related superfamilies. Phylogenetic analyses recovered an affinity between the HE in the Leucocryptos cob intron and two green algal HEs, and that between the HE in the Leucocryptos cox1 intron and a fungal HE, suggesting that the Leucocryptos cob and cox1 introns possess distinct evolutionary origins. Although the current intron (and intronic HE) data are insufficient to infer how the homologous introns were distributed to distantly related mt genomes, the results presented here successfully expanded the evolutionary dynamism of group I introns in mt genomes

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

The complete mitochondrial genome of the oriental fruit moth Grapholita molesta (Busck) (Lepidoptera: Tortricidae)

Author: AC Lessinger
AE Timm
AL Il’ichev
Bao-cai Shi
BS Coates
C Simon
C Simon
DM Borchert
DR Wolstenholme
DX Zhang
ES Lee
F Liao
Fan Zhang
G Hong
I Kim
J Hu
JJ Cannone
JJ Gillespie
JL Boore
JL Boore
JM Cornuet
K Yukuhiro
L Yang
MA Larkin
MH Pan
MN Schnare
MVG Torriani
MY Hong
NC Sheffield
O Niehuis
P Salvato
RG Foottit
S Wei
Shu-jun Wei
SJ Wei
SJ Wei
SJ Wei
SL Cameron
SR Kim
ST Jiang
TM Lowe
W Li
WL Roelofs
XL Hu
Ya-jun Gong
Zong-jiang Kang
Publication venue: Springer Netherlands
Publication date: 01/01/2011
Field of study

The oriental fruit moth, Grapholita molesta (Busck) (Lepidoptera: Tortricidae) currently is one of the economically most destructive pest species of stone and pome fruits worldwide. Here we sequenced the complete mitochondrial genome of this pest. This genome is 15,776 bp long, with an A + T content of 81.24%, containing 37 typical animal mitochondrial genes and an A + T-rich region. All gene are arranged as hypothesized ancestral gene order of insects except for trnM, which was shuffled from 3′ downstream of trnQ to 5′ upstream of trnI. cox1 gene uses unusual CGA start codon, as that in all other sequenced lepidopteran mitochondrial genome. The secondary structures for the two rRNA genes were predicted. All helices typically present in insect mitochondrial rRNA genes are generated. A microsatellite sequence was inserted into the region of H2347 in rrnL in G. molesta and two other sequenced tortricid mitochondrial genomes, indicating that the insertion event in this helix might occurred anciently in family Tortricidae. All of the 22 typical animal tRNA genes have a typical cloverleaf structure except for trnS2, in which the D-stem pairings in the DHU arm are absent. An intergenic sequence is present between trnQ and nad2 as well as in other sequenced lepidopteran mitochondrial genomes, which was presumed to be a remnant of trnM gene and its boundary sequences after the duplication of trnM to the upstream of trnI in Lepidoptera. The A + T-rich region is 836 bp, containing six repeat sequences of “TTATTATTATTATTAAATA(G)TTT.

Crossref

Springer - Publisher Connector

PubMed Central

Tensor Decomposition Reveals Concurrent Evolutionary Convergences and Divergences and Correlations with Structural Motifs in Ribosomal RNA

Author: Andrew M. Gross
BT Wimberly
Chaitanya Muralidhara
CP Ridley
CR Vossbrinck
CR Woese
CR Woese
CR Woese
EA Doherty
EW Sayers
F Schluenzen
FH Crick
G Lentzen
GH Golub
J Cadima
J Isaksson
JH Cate
JI Sagara
JJ Cannone
JM Ogle
L Lancaster
L Omberg
L Omberg
LE Orgel
N Ban
O Alter
O Alter
O Alter
Orly Alter
P Nissen
Purificación López-García
Robin R. Gutell
RP Hirt
RR Gutell
RR Gutell
RR Gutell
S Tavazoie
S Winker
SL Baldauf
SR Eddy
WJ Bock
Publication venue: Public Library of Science
Publication date: 29/04/2011
Field of study

Evolutionary relationships among organisms are commonly described by using a hierarchy derived from comparisons of ribosomal RNA (rRNA) sequences. We propose that even on the level of a single rRNA molecule, an organism's evolution is composed of multiple pathways due to concurrent forces that act independently upon different rRNA degrees of freedom. Relationships among organisms are then compositions of coexisting pathway-dependent similarities and dissimilarities, which cannot be described by a single hierarchy. We computationally test this hypothesis in comparative analyses of 16S and 23S rRNA sequence alignments by using a tensor decomposition, i.e., a framework for modeling composite data. Each alignment is encoded in a cuboid, i.e., a third-order tensor, where nucleotides, positions and organisms, each represent a degree of freedom. A tensor mode-1 higher-order singular value decomposition (HOSVD) is formulated such that it separates each cuboid into combinations of patterns of nucleotide frequency variation across organisms and positions, i.e., “eigenpositions” and corresponding nucleotide-specific segments of “eigenorganisms,” respectively, independent of a-priori knowledge of the taxonomic groups or rRNA structures. We find, in support of our hypothesis that, first, the significant eigenpositions reveal multiple similarities and dissimilarities among the taxonomic groups. Second, the corresponding eigenorganisms identify insertions or deletions of nucleotides exclusively conserved within the corresponding groups, that map out entire substructures and are enriched in adenosines, unpaired in the rRNA secondary structure, that participate in tertiary structure interactions. This demonstrates that structural motifs involved in rRNA folding and function are evolutionary degrees of freedom. Third, two previously unknown coexisting subgenic relationships between Microsporidia and Archaea are revealed in both the 16S and 23S rRNA alignments, a convergence and a divergence, conferred by insertions and deletions of these motifs, which cannot be described by a single hierarchy. This shows that mode-1 HOSVD modeling of rRNA alignments might be used to computationally predict evolutionary mechanisms

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Texas ScholarWorks

RNAcentral 2021: secondary structure integration, improved sequence search and new member databases

Author: Barshir R
Bateman A
Bouchard-Bourelle P
Bruford E
Cannone JJ
Chan PP
dos Santos G
Finn RD
Fishilevich S
Frankish A
Fromm B
Gorodkin J
Griffiths-Jones S
Gutell RR
Hatzigeorgiou AG
Hoksza D
Kalvari I
Karagkouni D
Karlowski WM
Kay S
Kramarz B
Lovering RC
Lowe TM
Lui LM
Ma L
Mani P
Marygold S
Mestdagh P
Mudge JM
Nawrocki EP
Panni S
Peterson KJ
Petrov A
Petrov AS
Porras P
Ramachandran S
Ribas CE
Scott M
Seal R
Seemann SE
Sweeney BA
Szymanski M
Volders P-J
Weinberg Z
Weng S
Zhang Z
Publication venue: OXFORD UNIV PRESS
Publication date: 08/01/2021
Field of study

RNAcentral is a comprehensive database of non-coding RNA (ncRNA) sequences that provides a single access point to 44 RNA resources and >18 million ncRNA sequences from a wide range of organisms and RNA types. RNAcentral now also includes secondary (2D) structure information for >13 million sequences, making RNAcentral the world’s largest RNA 2D structure database. The 2D diagrams are displayed using R2DT, a new 2D structure visualization method that uses consistent, reproducible and recognizable layouts for related RNAs. The sequence similarity search has been updated with a faster interface featuring facets for filtering search results by RNA type, organism, source database or any keyword. This sequence search tool is available as a reusable web component, and has been integrated into several RNAcentral member databases, including Rfam, miRBase and snoDB. To allow for a more fine-grained assignment of RNA types and subtypes, all RNAcentral sequences have been annotated with Sequence Ontology terms. The RNAcentral database continues to grow and provide a central data resource for the RNA community. RNAcentral is freely available at https://rnacentral.org

UCL Discovery

Mutational Patterns in RNA Secondary Structure Evolution Examined in Three RNA Families

Author: Anuj Srivastava
AT Dandjinou
B Knudsen
C Lemieux
C Zwieb
C Zwieb
CR Woese
David Liberles
DR Maddison
E Rivas
ES Haas
ES Haas
F Ronquist
FJ Sun
GE Fox
H Ly
I Holmes
IL Hofacker
J Felsenstein
J Lingner
JA Hartigan
Jan Mrázek
JC Ellis
JD Podlevsky
JJ Cannone
JL Chen
JL Thorne
JL Thorne
JP Huelsenbeck
JR Cole
JS McCaskill
JW Brown
JW Brown
KP Williams
Liming Cai
LJ Collins
M Mccormickgraham
M Zuker
MT Dixon
NJ Savill
NM Krishnan
P Gueneau de Novoa
PD Williams
R Malmberg
RJ Klein
RK Bradley
RR Gutell
RR Gutell
Russell L. Malmberg
S Griffiths-Jones
S Smit
WP Maddison
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

The goal of this work was to study mutational patterns in the evolution of RNA secondary structure. We analyzed bacterial tmRNA, RNaseP and eukaryotic telomerase RNA secondary structures, mapping structural variability onto phylogenetic trees constructed primarily from rRNA sequences. We found that secondary structures evolve both by whole stem insertion/deletion, and by mutations that create or disrupt stem base pairing. We analyzed the evolution of stem lengths and constructed substitution matrices describing the changes responsible for the variation in the RNA stem length. In addition, we used principal component analysis of the stem length data to determine the most variable stems in different families of RNA. This data provides new insights into the evolution of RNA secondary structures and patterns of variation in the lengths of double helical regions of RNA molecules. Our findings will facilitate design of improved mutational models for RNA structure evolution

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Fragmented Mitochondrial Ribosomal RNAs of Plasmodium falciparum

Author: A Barta
A Ben-Shem
A Kairo
AB Vaidya
BF Lang
Bob Lightowlers
Bryan H. Sands
BT Wimberly
CA Milbury
CA Raabe
CH Slamovits
CJ Jackson
D Tu
DA Joy
DE Gillespie
DF Spencer
DH Rehkopf
DJ Conway
DJ Klein
DJ Klein
DT Dubin
DT Dubin
E Evguenieva-Hackenberg
F Schluenzen
GC Shukla
Germaine Tami
HJ Bernstein
J Brosius
J Cox-Singh
J Krungkrai
J Rabl
JA Mears
JA Mears
Jamie J. Cannone
JE Feagin
JE Feagin
JE Feagin
JE Feagin
Jean E. Feagin
JJ Cannone
JL Hansen
JL Hansen
JL Hansen
JL Hansen
JT Joseph
Jung C. Lee
K Hikosaka
K Hikosaka
K Suplick
Kevin J. Coe
KJ Dechering
M Fry
M Payne
M Turmel
Maria Isabel Harrell
MJ Gardner
MM Yusupov
MN Schnare
MN Schnare
MR Sharma
Murray N. Schnare
MV Rodnina
MW Gray
N Ban
N Ban
NR Voss
P Boer
P Chandramouli
P Chomczynski
P Nissen
P Nissen
P Sloof
PB Moore
R Kamikawa
R Okimoto
R Staden
RA Van Etten
RJ Wilson
Robin R. Gutell
RQ Lin
RR Gutell
RR Gutell
RR Gutell
S Jongwutiwes
S Krief
SL Perkins
T Powers
TA Steitz
TA Tatusova
TC White
TYK Heinonen
VF de la Cruz
VF de la Cruz
W Liu
W Trager
Y Ji
Publication venue: Public Library of Science
Publication date: 22/06/2012
Field of study

The mitochondrial genome in the human malaria parasite Plasmodium falciparum is most unusual. Over half the genome is composed of the genes for three classic mitochondrial proteins: cytochrome oxidase subunits I and III and apocytochrome b. The remainder encodes numerous small RNAs, ranging in size from 23 to 190 nt. Previous analysis revealed that some of these transcripts have significant sequence identity with highly conserved regions of large and small subunit rRNAs, and can form the expected secondary structures. However, these rRNA fragments are not encoded in linear order; instead, they are intermixed with one another and the protein coding genes, and are coded on both strands of the genome. This unorthodox arrangement hindered the identification of transcripts corresponding to other regions of rRNA that are highly conserved and/or are known to participate directly in protein synthesis.The identification of 14 additional small mitochondrial transcripts from P. falciparum and the assignment of 27 small RNAs (12 SSU RNAs totaling 804 nt, 15 LSU RNAs totaling 1233 nt) to specific regions of rRNA are supported by multiple lines of evidence. The regions now represented are highly similar to those of the small but contiguous mitochondrial rRNAs of Caenorhabditis elegans. The P. falciparum rRNA fragments cluster on the interfaces of the two ribosomal subunits in the three-dimensional structure of the ribosome.All of the rRNA fragments are now presumed to have been identified with experimental methods, and nearly all of these have been mapped onto the SSU and LSU rRNAs. Conversely, all regions of the rRNAs that are known to be directly associated with protein synthesis have been identified in the P. falciparum mitochondrial genome and RNA transcripts. The fragmentation of the rRNA in the P. falciparum mitochondrion is the most extreme example of any rRNA fragmentation discovered

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Cryptic species in a well-known habitat: applying taxonomics to the amphipod genus Epimeria (Crustacea, Peracarida)

Author: A Barco
A Bortulus
A Luchetti
A-N Lörz
A-N Lörz
A-NE Lörz
ACA Boeck
ASPS Reboleira
B Chevreux
B Dayrat
BC Schlick-Steiner
C d’Udekem d’Acoz
C d’Udekem d’Acoz
C Hahn
C Havermans
C Havermans
C Mora
C Villacorta
CC Thompson
CO Coleman
CO Coleman
CS Bate
D Darriba
D Laslett
E Chevreux
EV Romanova
F Kilpert
FO Costa
FO Costa
G Lecointre
GG Fortes
GS Karaman
H Dreyer
H Ma
H Mahamat
H Xu
IA Arif
J Chun
J Felsenstein
J Geller
J Landschoff
J Lobo
J Rock
JC Meyran
JD Thompson
JE Frey
JJ Cannone
JJ Gillespie
JM Padial
JR Ellis
K Katoh
K Stephensen
K Stephensen
L Hendrich
M Aliabadian
M Bernt
M Hasegawa
M Kearse
M Kimura
M Ledoyer
M Morgulis
M Passamonti
M Richter
M Stoneking
M Westbury
MJ Raupach
MJ Raupach
MJ Raupach
MJ Raupach
MJ Raupach
ML Verheye
ML Verheye
MR Pie
N Saitou
N Stork
NM Kilgallen
NV Ivanova
O Folmer
O Hawlitschek
P Carpentieri
P Enequist
PDN Hebert
PDN Hebert
PG Moore
PKL Ng
R DeSalle
R Saiki
R Schuh
RC Edgar
RC Wilkerson
RR May
RR May
S Ehrich
S Kumar
S Matsukami
S Ratnasingham
S Ratnasingham
T Delić
T Knebelsberger
T Krapp-Schickel
T Lefébure
TA Hall
TJ Crease
TL Clark
TL Erwin
UO Hwang
V Khalaji-Pirbalouty
W Martin
Y Liu
Z Hou
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Taxonomy plays a central role in biological sciences. It provides a communication system for scientists as it aims to enable correct identification of the studied organisms. As a consequence, species descriptions should seek to include as much available information as possible at species level to follow an integrative concept of ‘taxonomics’. Here, we describe the cryptic species Epimeria frankei sp. nov. from the North Sea, and also redescribe its sister species, Epimeria cornigera. The morphological information obtained is substantiated by DNA barcodes and complete nuclear 18S rRNA gene sequences. In addition, we provide, for the first time, full mitochondrial genome data as part of a metazoan species description for a holotype, as well as the neotype. This study represents the first successful implementation of the recently proposed concept of taxonomics, using data from highthroughput technologies for integrative taxonomic studies, allowing the highest level of confidence for both biodiversity and ecological research

Crossref

Electronic Publication Information Center

Hochschulbibliothekszentrum des Landes Nordrhein-Westfalen (hbz)

Structural Constraints Identified with Covariation Analysis in Ribosomal RNA

Covariation analysis is used to identify those positions with similar patterns of sequence variation in an alignment of RNA sequences. These constraints on the evolution of two positions are usually associated with a base pair in a helix. While mutual information (MI) has been used to accurately predict an RNA secondary structure and a few of its tertiary interactions, early studies revealed that phylogenetic event counting methods are more sensitive and provide extra confidence in the prediction of base pairs. We developed a novel and powerful phylogenetic events counting method (PEC) for quantifying positional covariation with the Gutell lab’s new RNA Comparative Analysis Database (rCAD). The PEC and MI-based methods each identify unique base pairs, and jointly identify many other base pairs. In total, both methods in combination with an N-best and helix-extension strategy identify the maximal number of base pairs. While covariation methods have effectively and accurately predicted RNAs secondary structure, only a few tertiary structure base pairs have been identified. Analysis presented herein and at the Gutell lab’s Comparative RNA Web (CRW) Site reveal that the majority of these latter base pairs do not covary with one another. However, covariation analysis does reveal a weaker although significant covariation between sets of nucleotides that are in proximity in the three-dimensional RNA structure. This reveals that covariation analysis identifies other types of structural constraints beyond the two nucleotides that form a base pair

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Texas ScholarWorks

FigShare

Conservation of intron and intein insertion sites: implications for life histories of parasitic genetic elements

Abstract Background Inteins and introns are genetic elements that are removed from proteins and RNA after translation or transcription, respectively. Previous studies have suggested that these genetic elements are found in conserved parts of the host protein. To our knowledge this type of analysis has not been done for group II introns residing within a gene. Here we provide quantitative statistical support from an analyses of proteins that host inteins, group I introns, group II introns and spliceosomal introns across all three domains of life. Results To determine whether or not inteins, group I, group II, and spliceosomal introns are found preferentially in conserved regions of their respective host protein, conservation profiles were generated and intein and intron positions were mapped to the profiles. Fisher's combined probability test was used to determine the significance of the distribution of insertion sites across the conservation profile for each protein. For a subset of studied proteins, the conservation profile and insertion positions were mapped to protein structures to determine if the insertion sites correlate to regions of functional activity. All inteins and most group I introns were found to be preferentially located within conserved regions; in contrast, a bacterial intein-like protein, group II and spliceosomal introns did not show a preference for conserved sites. Conclusions These findings demonstrate that inteins and group I introns are found preferentially in conserved regions of their respective host proteins. Homing endonucleases are often located within inteins and group I introns and these may facilitate mobility to conserved regions. Insertion at these conserved positions decreases the chance of elimination, and slows deletion of the elements, since removal of the elements has to be precise as not to disrupt the function of the protein. Furthermore, functional constrains on the targeted site make it more difficult for hosts to evolve immunity to the homing endonuclease. Therefore, these elements will better survive and propagate as molecular parasites in conserved sites. In contrast, spliceosomal introns and group II introns do not show significant preference for conserved sites and appear to have adopted a different strategy to evade loss.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central